Extracting a largest redundancy-free XML storage structure from an acyclic hypergraph in polynomial time
نویسندگان
چکیده
Given a hypergraph and a set of embedded functional dependencies, we investigate the problem of determining the conditions under which we can efficiently generate redundancy-free XML storage structures with as few scheme trees as possible. Redundancy-free XML structures guarantee both economy in storage space and the absence of update anomalies, and having the least number of scheme trees requires the fewest number of joins to navigate among the data elements. We know that the general problem is intractable. The problem may still be intractable even when the hypergraph is acyclic and each hyperedge is in Boyce-Codd Normal Form (BCNF). As we show here, however, given an acyclic hypergraph with each hyperedge in BCNF, a polynomial-time algorithm exists that generates a largest possible redundancyfree XML storage structure. Successively generating largest possible scheme trees from among hyperedges not already included in generated scheme trees constitutes a reasonable heuristic for finding the fewest possible scheme trees. For many practical cases, this heuristic finds the set of redundancy-free XML storage structures with the fewest number of scheme trees. In addition to a correctness proof and a complexity analysis showing that the algorithm is polynomial, we also give experimental results over randomly generated but appropriately constrained hypergraphs showing empirically that the algorithm is indeed polynomial.
منابع مشابه
Generating the fewest redundancy-free scheme trees from acyclic conceptual-model hypergraphs in polynomial time
Generating the fewest redundancy-free scheme trees from conceptual-model hypergraphs is NP-hard [11]. We show, however, that the problem has a polynomial-time solution if the conceptual-model hypergraph is acyclic. We define conceptual-model hypergraphs, cycles, and scheme trees, and then present a polynomial-time algorithm and show that it generates the fewest redundancy-free scheme trees. As ...
متن کاملOn an Enhancement of XML Applied for Mobile E-Commerce
Extensible Markup Language (XML) is a textual markup language which becomes more and more important in the Internet web service. However, some distinct disadvantages exist in XML, such as its nature of redundancy, which consumes the limited network’s bandwidth greatly especially in mobile computing. Considering the characteristics of the mobile commerce, the handsets’ memory capability and data...
متن کاملRRXF: Redundancy reducing XML storage in relations
Current techniques for storing XML using relational technology consider the structure of an XML document but ignore its semantics as expressed by keys or functional dependencies. However, when the semantics of a document are considered redundancy may be reduced, node identifiers removed where value-based keys are available, and semantic constraints validated using relational primary key technol...
متن کاملA Negative Conjunctive Query is Easy if and only if it is Beta-Acyclic
It is known that the data complexity of a Conjunctive Query (CQ) is determined only by the way its variables are shared between atoms, reflected by its hypergraph. In particular, Yannakakis [18, 3] proved that a CQ is decidable in linear time when it is α-acyclic, i.e. its hypergraph is α-acyclic; Bagan et al. [2] even state: Any CQ is decidable in linear time iff it is α-acyclic. (under certai...
متن کاملDesigning and Querying a Compact Redundancy Free XML Storage
XML, the universal data format for data exchange has seen phenomenal increase in database size necessitating the need for its compact storage coupled with simple accessibility. In previous works, all XML databases are implemented as a tree based structure which leads to increased space complexity. The proposed structure called RFX (Redundancy Free XML storage structure), addresses this issue by...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Inf. Syst.
دوره 35 شماره
صفحات -
تاریخ انتشار 2010